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Abstract 



This paper suggests five measures of association between two random vectors 
X = (Xl, . . . , X p ) and Y = (Yy, . . . , Y q ). They are copula based and therefore 
invariant with respect to the marginal distributions of the components Xi and 
Yj. The measures capture positive as well as negative association of X and 
Y. In case p — q — 1 they reduce to Spearman's rho. Various properties 
of these new measures are investigated. Nonparametric estimators, based 
on ranks, for the measures are derived and their small sample behavior is 
investigated by simulation. The measures are applied to characterise strength 
and direction of association of bond and stock indices of five countries over 
^ ■ time. 
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1. Introduction 



Association of two random variables X and Y has been thoroughly inves- 
tigated in the statistical literature but much less work concerns association 
of two random vectors X = (X 1 , . . . , X p ) and Y = (Y 1; . . . , Y q ). In the classi- 
cal statistical framework of multivari ate norma l ity an d linear correlation the 
theory of cano nical corre l ation 
coefficient 



see 



sec 



Hotclling (1936 )) and the so called RV 



Escoufierl (119731 ) and iRobert and Escoufierl ( 119761 )) are well 



known and often applied, in particular in the natural sciences. 

The focus of our application, however, is financial dat a which is noto- 
riously non-normal. It has further been pointed out (see lEmbrechts et al 



( 120021 )) that linear correlation might be inappropriate to measure the strength 
of association of financial data. Finally the above mentioned measures are 
not capable of distinguishing between positive and negative association which 
is a must in dealing with financial data. 



The recently introduced distance correlation (see ISzekely et al.l (120071 ) 
and ISzekely and Rizzol ( 120091 )) suffers from similar weaknesses. Further it 
depends on the marginal distributions of X and Y and requires moment 
restrictions. This shoul d be considered a s a dis adva ntage in applicat ion to 



financial data (see again lEmbrechts et al.l ( 20021 ) and iRemillardl ( 20091 )) 



The increasing use of copulas in the analysis and modeling of financial 
data suggests the development of copula based measures of association. Con- 



cerning association within one vec t or X 
such measures (see 



(Xi, . . . ,X P 



there exist various 
Schmid et al.l (l2009h for a recent survey). In this pa- 



per, we propose copula based measures of association between two vectors 
X = (Xi, . . . , X p ) and Y = (Yi, . . . , Y q ) . They are invariant with respect to 
marginal distributions as association is solely determined by the joint copula 
of X and Y. Therefore distributional assumptions (beside continuity) as well 
as moment restrictions are not necessary. The measures are further capable 
of measuring negative association. 

The measures are defined in such a way that they reduce to Spearman's 
rank correlation coefficient for p — q — 1. Instead of Spearman's coefficient, 
other measures, such as Kendall's coefficient, could have been used. The 
details of this approach are very similar and are omitted. 

Note that it is not the aim o f this p aper to derive tests for independ ence 



of X and Y (see iBeran et al.l (120071 ) . iKojadinovic and Holmes! ( 120091 ) and 



Quessyl ( 120101 ) for recent contributions). On the contrary the focus is mea- 
surement of type and strength of association in the case where X and Y are 



dependent. 

The structure of the paper is as follows: In section [2J we describe the 
notation and terminology used within the paper. In section [3], we state pop- 
ulation versions of the five copula based measures of association and derive 
some of their properties. For convenience, some calculations are collected in 
the appendix. Estimators for the measures are introduced in section HI Their 
finite sample properties are analysed in a simulation study. Matlab code for 
the estimators is available on request. Section [5] contains an empirical ex- 
ample with financial data, illustrating the usefulness of the new measures. 
Section [6] concludes. 



2. Notation and definitions 



Let X = (Xi, . . . , X p ) and Y = (Y]_, . . . , Y q ) be random vectors of dimen- 
sions p and q, respectively, defined on the same probability space. Through- 
out the paper we assume that the marginal distribution functions F Xi for 
i ----- l,,.,,p and F Yi for j = 1 , . . . , a are continuous functions. There- 



fore, according to the theorem of ISklarl (119591 ) there exists a unique copula 

C : [0, l] p+g — ► [0, 1] with 



P (Xi < xi, . . . , X p < x p , Y\ < yi, . . . , Y q < y q ) 

= C(F Xl (x 1 ),...,F Xp (x 



for (x-i x v ) £ 

are given in 



and (ull.. ,V q ) m e 



P j , F Yl (yi) ,---,F Yq (y q )) 
q Extensive port r ayals of copulas 



Nelsenl fl2006h .1jo3 (J1997J 1 and ICherubini et al.l (l2004 ). Let 
U = (U u . . . , Up) = (F Xl (X t ) ,...,F Xp (X p )) 



and 



V 



(V u ...,V q ) = {F Yl (Y 1 ) J ...,F Yq (Y q )). 

The copula C is the joint distribution function of (U, V). The marginal 

copulas of X and Y are given by A (u) = C (u, l q ) and B (v) 

for u = (iti, . . . , Up) G [0, l] p and v = (vi, . . . , v q ) 6 [0, l] q . Here 

denote vectors of ones of length p and q, respectively. By construction, A 

and B are the marginal distribution functions of U and V, respectively. 



C(l p ,v) 
l p and l q 



3. Population versions of measures of association 

This section introduces population versions of five copula based measures 
of association between random vectors X and Y. 

3.1. Mean of pairwise association 

The simplest measure of association between X and Y is the mean of all 
bivariate associations of Xj and Yj for i = 1, . . . ,p and j — 1, . . . , q, i.e., 






Px < Y pq 

LH i=\ j=i 

where p^ is Spearman's rho of Xi and Yj. Note that p^ is copula based 
because of 



Pij = 12 / / Cij (ui, Vj) dui dvj - 3 

'[0,1] ^[0,1] 



and C^ is the marginal copula of Xi and Yj (see iNelsenl ( 120061 )). 

The properties of Pxy are easy to derive and follow directly from its 
definition. 

1. We have —1 < Pxy — 1 an d the measure is invariant with respect to 
permutations within X and Y. 

2. Pxy = — 1 ^ an d only if p^ = — 1 for every combination of i and j, 
i.e., Xi and Yj are countermonotonic for all i and j. This implies that 

X={h 1 (X),...,h p (X)) and Y=( gi (X),...,g q (X)) 

for a random variable X and strictly increasing functions hi, i = 1, . . . , p 
and strictly decreasing functions gj, j = 1, . . . , q, and it follows that 

C (u, v) = max {0, (min {ui, . . . , u p } + min {vi, . . . , v q } — 1)} . 

3. p x Y = 1 if and only if py = 1 for every combination i = 1, . . . ,p and 
j = 1, . . . , q. Here, Xj and Xj are comonotonic for all i and j. This 
implies that 

X=(h 1 (X),...,h p (X)) and Y=( gi (X),...,g q (X)) 



e 



for strictly increasing functions hi,i = 1, . . . ,p and gj, j = 1, . . . , q for 
a random variable X. The copula C is then 

C (u, v) = min {u u . . . , u p , v u . . . , uj . 

For given and fixed marginal copulas A and 5 it is in general not pos- 
sible to find a copula C with A(u) = C (u, l q ) and -B(v) = C (l p , i>) 
which entails p = — 1 or p = +1. The latter cases are only pos- 
sible in the special cases A(u) = min {u±, . . . , u p } and -B(v) = 
mm{vi,...,v q }. 

If Xj and Yj are independent for all combinations i = 1, . . . ,p and 
j = 1, . . . ,q then p x Y = 0. The converse is not true as there may be 
some pij different from zero, but p = 0: 

Example 1. Consider two 2- dimensional random vectors X = (X\, X2) 
and Y— (Yi,Y 2 ) with the following matrix of Spearman's rank correla- 
tions: 

(la b b \ 

a 1 —b—b 

b — b 1 o 

\ b -b a 1 / 

Choosing, for example, a = 0.6 and b = 0.4, the matrix is positive 
semi- definite and thus a correlation matrix. In this case, the random 
vectors X and Y are clearly dependent, whereas the mean of the pair- 
wise associations is equal to 

1 . 

P = T (Pl3 + Pl4 + P23 + P24j = 0. 

5. Association as measured by p can be decomposed into two parts that 
describe the association within and between the two random vectors. 
Let (X, Y) = Z = (Zi, . . . , Z p+q ) and let p z denote total association 
within Z defined by 

Pz= { 2 J ^ Plr 

l,r€{l,...,p+q} 

where p\ r denotes Spearman's rho of Z[ and Z r . Then 



p + q 



-lip i 



Yl Yl p ir+ J2 p ir+ J2 



1=1 r=p+l l<r l<r 

I, r <={!,..., p} l,r£{p+l,...,p+q} 




(I) , . 6) 



/p+^Px + T^yPY- 



within 



pg 



+ 



£UJ2- = i. 



( , r) r +, } r +, } 

This property might be useful since decomposition of p z into a between and 
within part can be interesting in analysing financial data. 

3.2. Pearson correlation based measures of association 
The vectors X and Y are independent if and only if 

C(u,v) =A(n)B(y) 

for u G [0, l] p and v G [0, l] 9 . Equivalently, X and Y are independent if and 
only if 

C (u, v) = P (U ^ u, V ^ v) = P (U > u) P (V ^ v) = A (u) 5 (v) 

for u G [0, l] p and v G [0, l] 9 . Therefore, we may measure the distance of C 
to the independence case of X and Y by 

/ / (C (u, v) - A (u) B (v)) dvdu. 

J[0,l] p J[0,l] q 

This expression is equal to 

cov(7r p (U),7r g (V)) 
where 



^ ( u ) = n (! - m ») and ^ ( v ) = n (! 

»=i i=i 



6 



•»i, 



for u e [0, l] p and v e [0, 1} 9 (see Appendix I for a proof). 

The covariance is bounded by the product of the respective standard de- 
viations. Thus, a measure of association between X and Y which is bounded 
by —1 and 1 is 

Pl (X,Y) = p 1 (C7)- cov MU),7r 3 (V)) 



^/var (w p (U)) v/var (7r g (V)) 
The variances may be expressed by 

var(7T p (U)) = / / (i(uAu') - A(u) A(u'))du'du, 

J[o,i] p J[o,i] p 

analogously for var (TT q (V)) (see Appendix I for a derivation). By u A u' := 
(min{wi, u[}, . . . , min{u p , u' p }) we denote the component wise minimum of u 
and u'. 

The measure p\ is based on the covariance of U and V transformed by 
the functions 7r p and n q . An alternative approach is to transform U and V 
by their respective distribution functions A and B (see iNelsen et al.l (120031 )) 
and consider the measure of association defined by 

/*(X,Y) = *«7)- »v W U), B (V), 



y/var{A{U))y/var{B{y))' 
It is shown in Appendix I that 

cov (A (U) , B (V)) = / / (C (u, v) - A (u) B (v)) dB (v) dA (u) 

and 

var (A (U)) = [ [ (I(uVu')-I (u) 3 (u')) rfA (u) dA (u') 

</[0,l] p i[0,l] 9 

with an analogous expression for var (B (V)) . By uVu' := (max-fwi, u^}, . . . , 
max{ii p ,Mp}) we denote component wise maximum of u and u'. 
The properties of p\ and p2 are as follows: 

1. We have — 1 < pi < 1 and — 1 < p2 < 1 and the measures are invariant 
with respect to permutations within X and Y. 



2. If the vectors X and Y are independent, pi = and p 2 = 0. Again, the 
converse is not true. 

3. Measures p\ and p 2 are in general not capable to achieve +1 or —1 for 
given and fixed arbitrary copulas A and B of X and Y. To calculate 
the maximal and minimal values of, for example, pi(X,Y) for given 
copulas A and B we have to maximise and minimise pi(X, Y) over all 
random vectors (X, Y), where the copula of X is A and the one of Y 
is B. For fixed A and B, the only term in the definition of pi(X, Y) 
that can vary is 

f C(u,v)dudv = E [7r p (U)7r g (V)] 

J[0,1]p+i 

= / P [tt p (U) > s, n q (V) > t] ds dt. 

J[0,1] 2 

The joint survival function of 7T p (U) and 7r g (V) will be maximal (min- 
imal) if they are comonotone (countermonotone): 



C(u, v) dudv 

[0,l]P+9 

> / max (P [tt p (U) > s] + P [tt p (V) > t] - 1,0) ds dt, 



/ C(u, v) dudv 

J[0,1]p+i 



< / min (P [tt p (U) > s],P [tt p (V) > t]) ds dt. 

'[o,i] 2 



Example 2. Consider two 2- dimensional random vectors X = {X\, X2) 
and Y = (Y\, Y 2 ) with copulas Cx(mi, W2) = n(wi, u 2 ) one? Cy(^i, ^2) = 
W(vi,v 2 ), where H(ui, U2) '■= uiu 2 is the independence copula and 
W(vi,v 2 ) := max{f! + v 2 — 1,0} the countermonotone copula. In this 
case 



P[7v 2 (U)>s] = P[(l-U 1 )(l-U 2 )>s] 

1 — S — U\ 



1-s 



P\U 2 < 

o V 1 - u i 

Ui "1,— : ) o«i 

o V ! - u i 

Y - s 1 - s - Mi 
o l-wi 

1 - s + slog(s), 



t/l = Mi ) dtii 



du\ 



and 



P[7T 2 (V)>t} = P[(l-V 1 ){l-V 2 )>t] 

= P[(i-K)K>t] 



2 w 'iuax<!--t,0!>. 



77ie upper bound is given by 



C(u, v) dudv 



[0,1]« 



</ min(P[^ 2 (^> s],P[tt 2 (V) > *]) rfsrft 

i[0,l] 2 



[0,1] 

767 
13824 



min 1 — s + slog(s), 2* /max < - — t, 



0.0555 



dsdt 



Thus, the maximal association between X and Y in terms of p\ is 

J|o il 4 i^( u ' v ) ~ n(u) W{v)) du dv 



Pl,max(-^ Y) 



A/var 7r 2 ( U) var 7r 2 ( V) 
767 _ 11 

< 13824 4 6 



30 36 Y 9 16 

« 0.8408 

Note that similar examples can be given for /9 2 - 

5.5. Rank correlation based measures of association 

Instead of applying Pearson correlation to n p (U) and n q (V) as in p\ or 
to A(U) and B(V) as in p 2 one may apply rank correlation, leading to the 
measures ps and p 4 . Let 

K MU) (t):=P(n p (U)^t) te[0,l], 
K Wq(v) (t):=P(n q (V)^t) tE[0,l] 



and 

K MV) (t):=P(A(U)^t) te[0,l], 
K B{v) (t):=P(B(V)^t) te[0,l] 

and let ^(u) := #ir p (U) I 71 "? (U)) and Z^v) := ^(v) (n q (V)), Z A (u) : = 
K A(U) (A (U)) and Z B(V) := tf B (v) (# (V)). From Z Wj((u) , Z Vq(y) ~ £/ [0, 1] it 
follows that 

C^ p(u) ,Z Iq(v) (M) = P (X P (U) < S, Z ff , (V ) < t) 

= io,lf /p,l]« 1 {^CO)(*p(»)X-} 1 {^CV)(ir«W)<*} dC ( U ' V ) 

is the joint distribution function and copula of Z 7rp ( U ) and Z nq (yy Implicitly 
since Za(jj), Z Bt y) ~ £7 [0, 1] , the copula 

C z MV) ,z B{v) (s,t) = P (Z A(U) < s, Z B(V) < t) 

[0il]P /o,i/ { ^ (u)(A(u)Ks}1{ ^ (v)(B(v)Kt} ^ (U ' V) 
10 



is the distribution function and copula of (Za(v)^b(V)) ■ Based on these 
copulas, we define the measures 



Ps(C) = pz (UJ.Z w = 12 / / C z z (s,t) dtds - 3, 

Jo Jo 



i.e., Spearman's rho of -^^(u) and -^(v) and 

p 4 (C) = 12 f f C Za(v)iZb ^ (s, t) dtds - 3, 
Jo Jo 

i.e., Spearman's rho of Za(xj) an d ^B(V)- Note that - contrary to the case in 
section I3T21 - normalisation is not necessary as it is impicitly ensured. 
Properties of p 3 and p 4 : 

1. We have — 1 ^ p% ^ +1 and —1 ^ p 4 ^ +1 and the measures are 
invariant with respect to permutations within X and Y. 

2. If X and Y are independent then C Znp{lJ) ,z nqm 0, t) = Cz A(u)iZfl(v) (s, t) = 
s£ and thus p 3 = p 4 = 0. Again, the converse is not true. 

3. p 3 = 1 is equivalent to 

^ p(U )A g (v)M)=min{s,t} 

and 

P (#ir pC u) (vr P (U)) = ^ (v) (tt, (V))) = 1. 

p 3 = — 1 is equivalent to 

Cz„ pQa) ,z„ gCV) ( s ' *) = max {« + t - 1, 0} 

and 

P (#MU) fa> (U)) = 1 - K Wq{v) (w q (V))) = 1. 

4. p 4 = 1 is equivalent to C z Zb , (s,t) = min{s,t} and 

P (X (u) (A (U)) = K B{y) (B (V))) = 1. 
p 4 = — 1 is equivalent to 

C Za(o),Zbcv) ( s ' *) = max {s + t-1,0} 



11 



and 

P (K A{V) (A (U)) = 1 - K B{V) (B (V))) = 1. 

5. Contrary to pi and P2, the rank correlation based measures p^ and p^ 
can achieve every value between —1 and +1 for any given and fixed 
marginal copulas A and B. To show this for p% and +1 let (S, T) be 
a comonotone pair of random variables such that S has the same dis- 
tribution as 7T p (U) and T has the same distribution as ir q (V). Second, 
conditionally on (S,T) = (s,t), let U' and V be independent random 
vectors whose conditional distributions are given by the one of U given 
7T P (U) = s and the one of V given ir q (V) = t, respectively. Now, the 
copulas of U' and V are A and B by construction and p% — 1 due to 
the comonotonicity of (S,T). Analogous arguments hold for p 3 = —1 
and p 4 = ±1. 

4. Statistical estimation of the measures 

In this section we propose nonparametric estimators of the discussed mea- 
sures. It is assumed that the marginal distribution functions F x . and Fy are 
unknown for i — 1, . . . ,p and j — 1, . . . , q. 

Let (Xi, Yi) , . . . , (X n , Y n ) be i.i.d. samples from (X, Y) . Let Fx un an d 
Fy- t n denote the empirical distribution function of Xi and Yj for i — 1, . . . ,p 
and j — 1, . . . , q. Then, for k — 1, . . . , n, 



Ufc,n — [F Xl ,n (Xik) , • • • , F Xp ,n (^pk) 



and 



Yfc, n — f F Yl! n (Xlk) , ■ ■ ■ , FY q ,n (Xqk) 

are called the pseudo observations of 

Ufc = (F Xl (Xi k ) ,■■■ , F Xp (X p k)) 

and 

V k =(F Yl (Yik),...,F Yq (y qk )). 



12 



The empirical distribution function of (Ufe jn , ^Vk,n) f° r k = 1, . . . ,n, i.e., 

1 - 

Cn(u,v) = -X.^U^u} 1 ^,^} 



is the empirical copula of (X, Y) (see, e.g.. lDeheuvelsl (I1979I )). The marginal 



empirical copulas of X and Y are estimated by 

A n (u) = C n (u, l q ) and B n (v) = C n (l p , v) . 
In the following, estimation is based on the pseudo observations 

(Ui,„, . . . , U ~ n ,n) and rVi >n , • • • , V„ iri J . 
Estimation of p x y- The estimator for p x y i s given by 



p n = p (Cn) = — ^ ^ ( 12 / C ijin (ttj, Vj) dui dvj - 3 J 
V * PQJ-^j^K i[o,i] 2 / 



'kj.n 

w i=\ j=\ \ \ k= i 

Estimation of p\. The estimator pi in for p\ is the Pearson coefficient of 
correlation of 



7T p ( Ui : „J , . . . , Tip ( \J n ,n) and n q ( Vl,„J , . . . , Tip ( V n ,, 

Estimation of p2- We first estimate yl and I? by 

A n (u) = C n (u, lg) and B n (v) = C n (l p , v) 
and obtain pseudo observations on A (U) and B (V) by 

A n (Ui, n ) , • • • , l n (U n , n ) and £ n (Vi,„) , . . . , B n (V r 



P2,n is the Pearson coefficient of correlation of these pseudo observations. 
Estimation of p%. In order to estimate p3, we first have to estimate 



13 



K * P (u) b y 

1 n 

K MV) , n (t) = -J2 1 {. P (u fc ,„)a} 

fc=l 

for t G [0, 1] and similarly K nq (yy Let 

1 n 

^r p (U),fc,n = K MV)in (7t p (Ufc.n)) = ~ J] 1 { ffj ,(u J ,„)< Wp (u M )} 

and 



1 n 
for /c = 1, . . . , n. Then with 



1=1 



■^ 1 . \ 

^p(U).^,(V),n (M) := ~Z^ {^ p (U),fc,n<s} {^<,(V),fe,n<*} 

k=l 

an estimator for p3 is 

P3,n = 12 / / CV p (U),7r 9 (V),n(M) CMS - 3 

Jo Jo 

1 n 
= U n 5J V 1 ~~ Z ^^)Xn) [ l - Z n q (V),k,n) - 3. 



fc=l 



Estimation of p 4 . The estimator p 4n for p 4 is derived in a similar way 
as p 3>n . Let 



and 



- 1 

fe=l 



1 n 
K B (V),n(t) = ~^2 1 { 



k=\ 



14 



Then 



KA(V),n ( Ai ( Ufe 



M(U) 



lA 

fc ' n ~~ n 1^ 1 {A n (u Ln )<:A n (lJ l 
1=1 



-)} 



and 



Kb(v),ii [B n ( V fcjn 



J B(V),k 



1 " 

z=i 



{B„(V !> „)<B„(V fei „)} 



for k = 1, ... ,n. For 



C 



^A(U).^S(V): ra V S ' 



1 - 

fe=l 



K{* 



S(V),fe,n 



<' 



it follows 

-1 /•! 



Jo Jo " fe=1 v 



A(U),k,n 



J B(V),k,n 



and thus 



Pi 



"(ito 



M(U),fc,n 



J B(V),k,n 



3. 



fe=i 



We have derived asymptotic normality of jo and pi using the asymptotic 
theory of the copula p roces s introduced by RiischendorJ (J1976I ) (see, e.g., 
Fermanian et al.l (J2004I ) and ISegersI ( accepted!) for recent refere n ces) a nd the 
functional delta method (see, e.g., IVan der Vaart and Wellnerl ( 1996I )). De- 
tails can be obtained from the authors. The asymptotic normality of j5i >n for 
i = 2,3,4 has not yet been derived. 

In the following, the finite sample properties of ~p n and pi >n , . . . , p^ n , i.e. 
their bias and standard deviation are investigated by a Monte Carlo simula- 
tion. In order to estimate the standard deviation of the estimators, we use 
the bootstrap and the jackknife. 

For a given sample (Xi, Yi), . . . , (X n , Y n ) of i.i.d. observations, the boot- 
strap draws n observations of the sample with replacement. Ties are solved 
by mid-ranks. For B bootstrap samples, the standard deviation is estimated 
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by 



a 



B 



\ 



B 



1 B 

6=1 



.(6) - 
Pn -P n 



M 



where p n is the estimator of the association of the 6-th bootstrap sample 
and p n their mean. 

For the jackknife estimator of the standard deviation of p n let p^-i de- 
note the estimator, where the j-th observation of (X l5 Yi), . . . , (X n , Y n ) is 
deleted. The jackknife estimate of the standard deviation is then given by 



a~ 



\ 



n 



n 



n 

E (/& 



Pn-l 



For the simulation study, we consider observatio ns from t he G aussian 



copula (see Ijod (119971 )) and the Clayton copula (see IClaytonl (119781 )) with 



different dimensions and different sample sizes. The Gaussian copula is de- 
fined by 

C (xii, . . . ,u d ) := $ e ($~Vi); • • • , ^M) , 

where $e is the distribution function of the multivariate normal distribution 
with zero mean, unit variances and positive definite correlation matrix = 
(9ij)ij = i^.. t d- Further, $ _1 denotes the quantile function of the univariate 
standard normal distribution. 



The d- dimensional Clayton copula (see IClaytonl ( 119781 )) is given by 



-1/6 



C e (ui,...,u n ) 



E 



U; 



-d+1 



for 9 > 0. 

To reduce the number of parameters in our model, we only consider the 
case of equi-correlation for the Gaussian copula, although simulations with 
more complex correlation matrices show similar results. The results are based 
on 10,000 Monte Carlo simulations and 500 bootstrap iterations, respectively. 

Tables CD to |3] show simulation results with the Gaussian copula for p, p 2 
and P4 for two random vectors of dimensions p = q = 3 and p = q = 4 and 
sample sizes 50, 100 and 500 as well as different correlation parameters 6. 
Results for the remaining measures and for the Clayton copula are omitted, 
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but can be obtained from the authors. They are, however, very similar to 
the results presented. 

The first two columns in the tables contain the value of the dependence 
parameters and the sample sizes. The third column of the tables shows an 
approximation to the true value of the measures of association, which has 
been derived from samples of size 1,000,000. Comparing the true values to 
the mean of the estimated associations m(p n ) in column 4, we observe a 
small finite sample bias, which decreases with increasing sample size. The 
standard deviation of the estimator s(p n ) and the means of the bootstrap 
estimation m(a B ) and the jackknife estimation m(a J ) are shown in colums 
5,6 and 7. It can be seen that both procedures for the estimation of the 
standard deviation perform well for the Gaussian copula. Furthermore, the 
standard deviation of the estimator decreases with increasing sample size in a 
reasonable way. Finally, columns 8 and 9 show that the standard error of the 
bootstrap standard deviation estimates is slightly smaller than the obtained 
jackknife estimates. 
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9 


n 


P 


m(p n ) 


sQ n ) 


m(a B ) 


m(a J ) 


s(a B ) 


s(a J ) 


Two 3-dimensional vectors 


-0.1 


50 


-0.096 


-0.094 


0.040 


0.039 


0.040 


0.007 


0.007 




100 


-0.096 


-0.095 


0.027 


0.028 


0.028 


0.003 


0.003 




500 


-0.096 


-0.096 


0.012 


0.012 


0.012 


0.001 


0.001 


0.2 


50 


0.191 


0.188 


0.063 


0.063 


0.064 


0.008 


0.008 




100 


0.191 


0.189 


0.045 


0.044 


0.045 


0.004 


0.004 




500 


0.191 


0.191 


0.020 


0.020 


0.020 


0.001 


0.001 


0.5 


50 


0.483 


0.474 


0.069 


0.070 


0.071 


0.007 


0.008 




100 


0.483 


0.479 


0.049 


0.049 


0.049 


0.004 


0.004 




500 


0.483 


0.482 


0.022 


0.022 


0.022 


0.001 


0.001 


Two 4-dimensional vectors 


-0.1 


50 


-0.096 


-0.094 


0.028 


0.027 


0.028 


0.005 


0.005 




100 


-0.096 


-0.095 


0.019 


0.019 


0.020 


0.003 


0.003 




500 


-0.096 


-0.095 


0.009 


0.009 


0.009 


0.001 


0.001 


0.2 


50 


0.191 


0.188 


0.054 


0.054 


0.055 


0.007 


0.007 




100 


0.191 


0.189 


0.038 


0.038 


0.039 


0.004 


0.004 




500 


0.191 


0.191 


0.017 


0.017 


0.017 


0.001 


0.001 


0.5 


50 


0.483 


0.474 


0.064 


0.064 


0.065 


0.006 


0.007 




100 


0.483 


0.478 


0.045 


0.045 


0.046 


0.003 


0.003 




500 


0.483 


0.481 


0.020 


0.020 


0.020 


0.001 


0.001 



Tabic 1: Simulation results for a equicorrclated Gaussian copula with correlation 9 and 
sample size n. ~p denotes the theoretical value. The empirical means m(-) and the empirical 
standard deviations s(-) are based on 10,000 Monte Carlo simulations and 500 bootstrap 
samples. The bootstrap estimates arc labeled by the superscript B, the jackknife estimates 
by J. 



9 


n 


92 


m {f>2,n) 


S (P2,n) 


m(a B ) 


m(d- J ) 


s(a B ) 


s(a J ) 


Two 3-dimensional vectors 


-0.1 


50 


-0.225 


-0.218 


0.104 


0.113 


0.110 


0.029 


0.038 




100 


-0.225 


-0.220 


0.071 


0.072 


0.073 


0.018 


0.021 




500 


-0.225 


-0.225 


0.031 


0.031 


0.031 


0.005 


0.005 


0.2 


50 


0.349 


0.335 


0.146 


0.143 


0.155 


0.021 


0.030 




100 


0.349 


0.343 


0.103 


0.102 


0.106 


0.012 


0.015 




500 


0.349 


0.347 


0.046 


0.046 


0.046 


0.003 


0.003 


0.5 


50 


0.682 


0.664 


0.095 


0.098 


0.100 


0.021 


0.025 




100 


0.682 


0.673 


0.066 


0.066 


0.067 


0.012 


0.012 




500 


0.682 


0.681 


0.029 


0.029 


0.029 


0.003 


0.002 


Two 


4-dimensional vectors 












-0.1 


50 


-0.235 


-0.226 


0.073 


0.123 


0.087 


0.025 


0.033 




100 


-0.235 


-0.231 


0.048 


0.058 


0.054 


0.013 


0.015 




500 


-0.235 


-0.234 


0.021 


0.021 


0.021 


0.003 


0.003 


0.2 


50 


0.388 


0.370 


0.157 


0.151 


0.168 


0.025 


0.045 




100 


0.388 


0.378 


0.110 


0.108 


0.115 


0.016 


0.023 




500 


0.388 


0.387 


0.050 


0.049 


0.050 


0.004 


0.004 


0.5 


50 


0.723 


0.666 


0.094 


0.097 


0.099 


0.021 


0.026 




100 


0.723 


0.674 


0.065 


0.066 


0.067 


0.011 


0.012 




500 


0.723 


0.721 


0.028 


0.028 


0.028 


0.003 


0.003 



Table 2: Simulation results for a equicorrelatcd Gaussian copula with correlation 9 and 
sample size n. p2 denotes the approximation of the theoretical value, estimated from a 
sample with sample size 100,000,000. The empirical means m(-) and the empirical standard 
deviations s(-) are based on 10,000 Monte Carlo simulations and 500 bootstrap samples. 
The bootstrap estimates are labeled by the superscript B, the jackknife estimates by J. 
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9 


n 


P4 


m(p4, n ) 


S (h,n) 


m(a B ) 


m(d- J ) 


s(a B ) 


s(a J ) 


Two 3-dimensional vectors 


-0.1 


50 


-0.310 


-0.250 


0.126 


0.133 


0.147 


0.010 


0.017 




100 


-0.310 


-0.277 


0.091 


0.089 


0.100 


0.006 


0.008 




500 


-0.310 


-0.302 


0.041 


0.039 


0.042 


0.002 


0.001 


0.2 


50 


0.375 


0.330 


0.123 


0.118 


0.135 


0.012 


0.016 




100 


0.375 


0.353 


0.086 


0.083 


0.092 


0.007 


0.008 




500 


0.375 


0.369 


0.039 


0.038 


0.040 


0.002 


0.002 


0.5 


50 


0.694 


0.635 


0.081 


0.084 


0.090 


0.015 


0.019 




100 


0.694 


0.665 


0.057 


0.057 


0.060 


0.008 


0.010 




500 


0.694 


0.688 


0.025 


0.025 


0.025 


0.002 


0.002 


Two 


4-dimensional vectors 












-0.1 


50 


-0.462 


-0.265 


0.110 


0.150 


0.144 


0.011 


0.021 




100 


-0.462 


-0.337 


0.085 


0.101 


0.102 


0.007 


0.010 




500 


-0.462 


-0.428 


0.038 


0.036 


0.040 


0.002 


0.002 


0.2 


50 


0.433 


0.356 


0.118 


0.116 


0.137 


0.012 


0.017 




100 


0.433 


0.394 


0.085 


0.080 


0.092 


0.007 


0.009 




500 


0.433 


0.426 


0.037 


0.036 


0.038 


0.002 


0.002 


0.5 


50 


0.740 


0.667 


0.077 


0.080 


0.086 


0.015 


0.020 




100 


0.740 


0.704 


0.052 


0.052 


0.055 


0.008 


0.010 




500 


0.740 


0.733 


0.022 


0.022 


0.023 


0.002 


0.002 



Tabic 3: Simulation results for a equicorrclatcd Gaussian copula with correlation 9 and 
sample size n. p4 denotes the approximation of the theoretical value, estimated from a 
sample with sample size 100,000,000. The empirical means m(-) and the empirical standard 
deviations s(-) are based on 10,000 Monte Carlo simulations and 500 bootstrap samples. 
The bootstrap estimates are labeled by the superscript B, the jackknife estimates by J. 
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We further investigate how well the finite sample distribution of the intro- 
duced empirical measures of association can be approximated by the normal 
distribution. To this end, we compute p,px,p2,p3 and p 4 for iV =10,000 
Monte Carlo simulations from two 2-dimensional random vectors from the 
Gaussian copula and the Clayton copula. For each copula, we use different 
dependence parameters and various sample sizes. We standardise the 10,000 
measures obtained from the Monte Carlo simulation by their sample mean 
and standard deviation, respectively, and use a kernel estimator to approxi- 
mate their density. 

The left panel of figure [1] shows the results for p 3 in case of the Gaussian 
copula, where the correlation matrix has the form 



/I 


0.5 


Q 


q\ 


0.5 


1 


Q 


Q 


e 


Q 


1 


0.5 


\e 


Q 


0.5 


!/ 



with q = —0.75,0 and 0.75. Whereas the density of the estimator is highly 
skewed for g = —0.75 and g = 0.75 for a sample size of 50, this asymmetry 
vanishes with increasing sample size and the density of the p^ for all of the 
considered values of g is barely distinguishable from the normal density for 
a sample size of 500. It has to be noted that —0.75 and 0.75 are the upper 
and lower bound of g such that S is a correlation matrix. For values closer 
to 0, the asymmetries are smaller. 

The right panel of figure [T] shows the results for a Clayton Copula with 
parameters a = 0.5,2 and 5. Again, we used p3 to measure the association. 
The other measures, however, show similar results, which are available upon 
request by the authors. For a sample size of 50, the density of the estimator 
is slightly skewed for all parameters, nevertheless the highest skewness occurs 
for a = 5. As for the Gaussian copula, the skewness decreases with increasing 
sample size and is barely observable for a sample size of 500. 

Having performed similar Monte Carlo simulations for other dimensions 
and dependence parameters, we conclude that for a sample size of 500 the 
finite sample distribution of the association measure can very well be approx- 
imated by the normal distribution. 
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Figure 1: Kernel density estimates of the normalised estimator for ps for the Gaussian 
copula with g = —0.75, and 0.75 (left panel) and the 4 dimensional Clayton Copula with 
a = 0.5,2 and 5 (right panel). The solid line depicts the standard normal distribution. 



5. Empirical example 

In our empirical example we make an attempt to measure strength and 
direction of association of the bond and the stock market. We make use of the 
five copula based measures of association presented in section [3J Moreover, 
for the sake of comparison, the traditional canonical correlation, the RV 
coefficient and distance correlation are applied. We consider daily returns 
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of the stock market indices of five major countries 1 as well as government 
bonds indices from The Bank of America Merrill Lynch 2 for the respec- 
tive countries during the period from January 3rd, 1996 to December 30th, 
2010. Figure [2] shows the evolution of the association of bond and stock 
market, based on a forward-looking moving window with a window size of 
250 days. In particular, the first value of each measure is based on the 250 
daily returns following January, 2rd, 1996. The last value is estimated from 
the 250 daily returns from January 18th, 2010 until the end of 2010. The 
top panel shows that canonical correlation, distance correlation and the RV 
coefficient exhibit similar patterns of association. Their scaling is quite differ- 
ent, however. Canonical correlation is always highest, RV coefficient always 
lowest. Distance correlation is somehow between the two, but closer to the 
canonical correlation in general. The evolution of association over time as 
indicated by the five measures p and pi, . . . , p 4 is shown in the bottom panel. 
The differences between their values are in general smaller than these of the 
aforementioned measures. The evolution of p seems to be smoothest which 
is not surprising, while p% is most erratic. It can be seen that there is a 
tendency of decreasing association from 1996 to 2002, association is close to 
zero between 2002 and 2006 and becomes negative afterwards. A pattern of 
association like this can only be recorded by using measures of the type which 
we introduced in section [3j Note that the identified pattern of association 
is an empirical finding, for which we do not attempt to provide an economic 
explanation. The graphs in figure [2] are based on the returns themselves. We 
have, however, made similar graphs for filtered data where autocorrelation 
and heteroscedasticity have been removed. The results for the filtered data 
are very close to those of the unfiltered data. 

6. Conclusion 

Five measures of association between two random vectors X and Y have 
been introduced. They are copula based and do therefore not depend on the 



1 A11 Ordinaries (Australia), CAC 40 (France), DAX (Germany), Nikkei 225 (Japan) 
and S&P 500 (USA) 

2 The BofA Merrill Lynch Australia Government Index (GOTO), The BofA Merrill Lynch 
French Government Index (G0F0), The BofA Merrill Lynch Japan Government Index 
(G0Y0), The BofA Merrill Lynch German Government Index (G0D0) and The BofA 
Merrill Lynch Australia Government Index (GOTO) 
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Figure 2: Evolution of the association between the bonds and stock markets, measured by 
their canonical correlation, their distance correlation and their RV coefficient (top panel) 
and by the introduced measures of association ~p and p\,...,pi (bottom panel). The 
analysis is based on a moving window approach with a window size of 250 days. 



marginal distributions of the components Xi and Yj. They measure strength 
and direction of association, so they are capable of distinguishing positive and 
negative association. This is a substantial advantage in applications to real 
life data, in particular financial data. Estimators for the measures have been 
proposed and it was demonstrated by simulation that they have favorable 
small sample properties at least for n > 100. There is space for extension 
and complementation of the measures. First, it can be seen that the measures 
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Pi, . . . , P4 have in common that they are based on transformations /(U) and 
g(V), say, where / and g may, or may not, depend on the marginal copulas A 
and B. Therefore more general classes of measures can be defined, if further 
measures of bivariate association are applied to /(U) and g(V). 

Second, measures for association between d vectors Xi,X 2 , X 3 . . . , X^ 
of dimensions Pi,P2, ■ ■ ■ ,Pd, r espectively, can be defined if measures of d- 
variate association (see, e.g., ISchmid and Schmidtl (120071 )) are applied to 



/i(Ui), / 2 (U 2 ), . . . , fd(Ud) for appropriate functions /i, . . . , f d - 
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8. Appendix I 

1. Let n p (U) = Uli (1 " U t ) and n q (V) = nj =1 (1 - VA 
Then 

cov (tt p (U) , n q (V)) = E c (ti p (U) , n q (V)) - E A (n p (U)) E B (^ (V)) 

= / / Ti p (u) n q (v) dC(u, v) - / Tip (u) oL4(u) / n q (v) dB(i 

J[0,1]p J[0,l]4 ^[0,l] p "'[O,!] 9 



C(u, v)dudv — / A(u)du / B(v)dv 
[0,1]p J [0,l]i ^[o,i] p ^[o,i] 9 

(C(u,v)-A(u)B(v))d(u,v) 

[0,1]P ^[0,1] 9 

2. Consider A (U) = F x (X) and B (V) = F Y (Y) . 
We then have 

cov (A (U) , B (V)) = E c (A (U) B (V)) - E A (A (U)) E B (B (V)) 
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and 



E C (A(V)B(V)) = [ [ A(u)B(v)dC(u,v) 

J\0,1]p J\0,l]i 



[0,1]p J[0,l]i 

C ( u, v) dA (u) dB (v) 



'[0,1]p J[0,l]i 

Further 

E A (A(U))= f A(u)dA(u)= f A(u)dA(u) 

and 



E B (B(U))= B(u)dB(u)= B(u)dB(u). 

J[0,1]Q J[0,l] q 

Therefore 
cov(A(U),5(V)) = / / (C(u,v)-A(u)B(v))dA(u)dB(v) 

The measure to be defined is therefore based on the weighted difference 

C (u, v) - A (u) B (v) = P (U > u, V > v) - P (U > u) p (V > v) 

where the weights are given by A and B. 
Further 

var(A(U)) = E A (A(U)A(U))-(E A (A(U))) 2 

A (u) A (u) dA (u) - ( / A (u) rfA (u) 

A (u) A (V) dA (u A u') 

[0,1]p J[0,1]p 

f A(u)A (u) dA (u) dA (u) 

[o,i]p J[0,1]p 



f f (A (u V u') - A (u) A (u')) ^ (u) dA (u) 
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